Dataset statistics
| Number of variables | 31 |
|---|---|
| Number of observations | 20000 |
| Missing cells | 8061 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 4.7 MiB |
| Average record size in memory | 248.0 B |
Variable types
| Categorical | 15 |
|---|---|
| Numeric | 13 |
| Boolean | 3 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
qtde_contas_bancarias is highly correlated with qtde_contas_bancarias_especiais | High correlation |
qtde_contas_bancarias_especiais is highly correlated with qtde_contas_bancarias | High correlation |
qtde_contas_bancarias is highly correlated with qtde_contas_bancarias_especiais | High correlation |
qtde_contas_bancarias_especiais is highly correlated with qtde_contas_bancarias | High correlation |
tipo_residencia has 536 (2.7%) missing values | Missing |
meses_na_residencia has 1450 (7.2%) missing values | Missing |
profissao has 3097 (15.5%) missing values | Missing |
ocupacao has 2978 (14.9%) missing values | Missing |
renda_mensal_regular is highly skewed (γ1 = 67.75421325) | Skewed |
renda_extra is highly skewed (γ1 = 137.4095781) | Skewed |
valor_patrimonio_pessoal is highly skewed (γ1 = 126.6995194) | Skewed |
meses_no_trabalho is highly skewed (γ1 = 63.19895877) | Skewed |
inadimplente is uniformly distributed | Uniform |
qtde_dependentes has 13350 (66.8%) zeros | Zeros |
tipo_residencia has 331 (1.7%) zeros | Zeros |
meses_na_residencia has 1858 (9.3%) zeros | Zeros |
renda_extra has 18930 (94.7%) zeros | Zeros |
valor_patrimonio_pessoal has 19072 (95.4%) zeros | Zeros |
meses_no_trabalho has 19973 (99.9%) zeros | Zeros |
profissao has 1398 (7.0%) zeros | Zeros |
ocupacao has 1114 (5.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-15 22:13:43.264149 |
|---|---|
| Analysis finished | 2021-05-15 22:14:39.973707 |
| Duration | 56.71 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
produto_solicitado
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 1 | |
|---|---|
| 2 | |
| 7 | 542 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 7 |
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 17023 | |
| 2 | 2435 | 12.2% |
| 7 | 542 | 2.7% |
dia_vencimento
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.14725 |
|---|---|
| Minimum | 1 |
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 10 |
| median | 10 |
| Q3 | 20 |
| 95-th percentile | 25 |
| Maximum | 25 |
| Range | 24 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.748506839 |
|---|---|
| Coefficient of variation (CV) | 0.5133017809 |
| Kurtosis | -0.7233846608 |
| Mean | 13.14725 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.441538168 |
| Sum | 262945 |
| Variance | 45.54234455 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 7847 | |
| 15 | 3557 | |
| 25 | 3089 | 15.4% |
| 5 | 2825 | 14.1% |
| 20 | 1952 | 9.8% |
| 1 | 730 | 3.6% |
| Value | Count | Frequency (%) |
| 1 | 730 | 3.6% |
| 5 | 2825 | 14.1% |
| 10 | 7847 | |
| 15 | 3557 | |
| 20 | 1952 | 9.8% |
| Value | Count | Frequency (%) |
| 25 | 3089 | 15.4% |
| 20 | 1952 | 9.8% |
| 15 | 3557 | |
| 10 | 7847 | |
| 5 | 2825 | 14.1% |
forma_envio_solicitacao
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| internet | |
|---|---|
| presencial | |
| correio | 881 |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.74145 |
| Min length | 7 |
Characters and Unicode
| Total characters | 174829 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | presencial |
|---|---|
| 2nd row | internet |
| 3rd row | internet |
| 4th row | internet |
| 5th row | internet |
| Value | Count | Frequency (%) |
| internet | 11264 | |
| presencial | 7855 | |
| correio | 881 | 4.4% |
| Value | Count | Frequency (%) |
| internet | 11264 | |
| presencial | 7855 | |
| correio | 881 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 39119 | |
| n | 30383 | |
| t | 22528 | |
| r | 20881 | |
| i | 20000 | |
| c | 8736 | 5.0% |
| p | 7855 | 4.5% |
| s | 7855 | 4.5% |
| a | 7855 | 4.5% |
| l | 7855 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 174829 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 39119 | |
| n | 30383 | |
| t | 22528 | |
| r | 20881 | |
| i | 20000 | |
| c | 8736 | 5.0% |
| p | 7855 | 4.5% |
| s | 7855 | 4.5% |
| a | 7855 | 4.5% |
| l | 7855 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 174829 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 39119 | |
| n | 30383 | |
| t | 22528 | |
| r | 20881 | |
| i | 20000 | |
| c | 8736 | 5.0% |
| p | 7855 | 4.5% |
| s | 7855 | 4.5% |
| a | 7855 | 4.5% |
| l | 7855 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174829 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 39119 | |
| n | 30383 | |
| t | 22528 | |
| r | 20881 | |
| i | 20000 | |
| c | 8736 | 5.0% |
| p | 7855 | 4.5% |
| s | 7855 | 4.5% |
| a | 7855 | 4.5% |
| l | 7855 | 4.5% |
tipo_endereco
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 1 | |
|---|---|
| 2 | 127 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 19873 | |
| 2 | 127 | 0.6% |
sexo
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| F | |
|---|---|
| M | |
| N | 25 |
| 7 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | M |
| 5th row | F |
| Value | Count | Frequency (%) |
| F | 12246 | |
| M | 7722 | |
| N | 25 | 0.1% |
| 7 | < 0.1% |
| Value | Count | Frequency (%) |
| f | 12246 | |
| m | 7722 | |
| n | 25 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 12246 | |
| M | 7722 | |
| N | 25 | 0.1% |
| 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 19993 | |
| Space Separator | 7 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 12246 | |
| M | 7722 | |
| N | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19993 | |
| Common | 7 | < 0.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 12246 | |
| M | 7722 | |
| N | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 12246 | |
| M | 7722 | |
| N | 25 | 0.1% |
| 7 | < 0.1% |
idade
Real number (ℝ≥0)
| Distinct | 84 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.3525 |
|---|---|
| Minimum | 7 |
| Maximum | 106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 31 |
| median | 40 |
| Q3 | 52 |
| 95-th percentile | 70 |
| Maximum | 106 |
| Range | 99 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 14.93017713 |
|---|---|
| Coefficient of variation (CV) | 0.3525217433 |
| Kurtosis | -0.210705359 |
| Mean | 42.3525 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.5584304521 |
| Sum | 847050 |
| Variance | 222.9101893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 555 | 2.8% |
| 39 | 534 | 2.7% |
| 36 | 526 | 2.6% |
| 32 | 518 | 2.6% |
| 37 | 513 | 2.6% |
| 43 | 510 | 2.5% |
| 28 | 509 | 2.5% |
| 33 | 504 | 2.5% |
| 31 | 503 | 2.5% |
| 38 | 500 | 2.5% |
| Other values (74) | 14828 |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 17 | 7 | < 0.1% |
| 18 | 265 | |
| 19 | 260 | |
| 20 | 293 |
| Value | Count | Frequency (%) |
| 106 | 2 | |
| 100 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 96 | 2 | |
| 95 | 4 |
estado_civil
Real number (ℝ≥0)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.12085 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 81 |
| Zeros (%) | 0.4% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.33200375 |
|---|---|
| Coefficient of variation (CV) | 0.6280518423 |
| Kurtosis | 2.799170933 |
| Mean | 2.12085 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.76004596 |
| Sum | 42417 |
| Variance | 1.774233989 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 10088 | |
| 1 | 6519 | |
| 4 | 1573 | 7.9% |
| 6 | 763 | 3.8% |
| 5 | 522 | 2.6% |
| 3 | 234 | 1.2% |
| 7 | 220 | 1.1% |
| 0 | 81 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 81 | 0.4% |
| 1 | 6519 | |
| 2 | 10088 | |
| 3 | 234 | 1.2% |
| 4 | 1573 | 7.9% |
| Value | Count | Frequency (%) |
| 7 | 220 | 1.1% |
| 6 | 763 | |
| 5 | 522 | 2.6% |
| 4 | 1573 | |
| 3 | 234 | 1.2% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6664 |
|---|---|
| Minimum | 0 |
| Maximum | 53 |
| Zeros | 13350 |
| Zeros (%) | 66.8% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 53 |
| Range | 53 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.23672451 |
|---|---|
| Coefficient of variation (CV) | 1.855829097 |
| Kurtosis | 167.6045062 |
| Mean | 0.6664 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.925042325 |
| Sum | 13328 |
| Variance | 1.529487514 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13350 | |
| 1 | 2814 | 14.1% |
| 2 | 2189 | 10.9% |
| 3 | 1029 | 5.1% |
| 4 | 352 | 1.8% |
| 5 | 149 | 0.7% |
| 6 | 57 | 0.3% |
| 7 | 22 | 0.1% |
| 8 | 14 | 0.1% |
| 9 | 9 | < 0.1% |
| Other values (5) | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 13350 | |
| 1 | 2814 | 14.1% |
| 2 | 2189 | 10.9% |
| 3 | 1029 | 5.1% |
| 4 | 352 | 1.8% |
| Value | Count | Frequency (%) |
| 53 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 11 | 4 | |
| 10 | 7 |
nacionalidade
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 1 | |
|---|---|
| 0 | 808 |
| 2 | 40 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 19152 | |
| 0 | 808 | 4.0% |
| 2 | 40 | 0.2% |
possui_telefone_residencial
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.7 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 16474 | |
| False | 3526 | 17.6% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 536 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.261302918 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 331 |
| Zeros (%) | 1.7% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8835795418 |
|---|---|
| Coefficient of variation (CV) | 0.7005292139 |
| Kurtosis | 11.37714604 |
| Mean | 1.261302918 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.408604224 |
| Sum | 24550 |
| Variance | 0.7807128068 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 16497 | |
| 2 | 1635 | 8.2% |
| 5 | 827 | 4.1% |
| 0 | 331 | 1.7% |
| 4 | 126 | 0.6% |
| 3 | 48 | 0.2% |
| (Missing) | 536 | 2.7% |
| Value | Count | Frequency (%) |
| 0 | 331 | 1.7% |
| 1 | 16497 | |
| 2 | 1635 | 8.2% |
| 3 | 48 | 0.2% |
| 4 | 126 | 0.6% |
| Value | Count | Frequency (%) |
| 5 | 827 | 4.1% |
| 4 | 126 | 0.6% |
| 3 | 48 | 0.2% |
| 2 | 1635 | 8.2% |
| 1 | 16497 |
| Distinct | 76 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1450 |
| Missing (%) | 7.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.57245283 |
|---|---|
| Minimum | 0 |
| Maximum | 228 |
| Zeros | 1858 |
| Zeros (%) | 9.3% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 6 |
| Q3 | 15 |
| 95-th percentile | 30 |
| Maximum | 228 |
| Range | 228 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.64958027 |
|---|---|
| Coefficient of variation (CV) | 1.112523661 |
| Kurtosis | 18.11111445 |
| Mean | 9.57245283 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.340849526 |
| Sum | 177569 |
| Variance | 113.4135599 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2937 | |
| 0 | 1858 | 9.3% |
| 10 | 1510 | 7.5% |
| 5 | 1486 | 7.4% |
| 2 | 1319 | 6.6% |
| 3 | 953 | 4.8% |
| 20 | 934 | 4.7% |
| 15 | 776 | 3.9% |
| 8 | 672 | 3.4% |
| 6 | 666 | 3.3% |
| Other values (66) | 5439 | |
| (Missing) | 1450 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 1858 | |
| 1 | 2937 | |
| 2 | 1319 | |
| 3 | 953 | 4.8% |
| 4 | 643 | 3.2% |
| Value | Count | Frequency (%) |
| 228 | 1 | |
| 200 | 1 | |
| 100 | 1 | |
| 96 | 1 | |
| 89 | 1 |
possui_email
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 15984 | |
| 0 | 4016 | 20.1% |
| Distinct | 3031 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 957.1309375 |
|---|---|
| Minimum | 69 |
| Maximum | 959000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 69 |
|---|---|
| 5-th percentile | 289 |
| Q1 | 360 |
| median | 500 |
| Q3 | 800 |
| 95-th percentile | 1782.05 |
| Maximum | 959000 |
| Range | 958931 |
| Interquartile range (IQR) | 440 |
Descriptive statistics
| Standard deviation | 11353.965 |
|---|---|
| Coefficient of variation (CV) | 11.86249922 |
| Kurtosis | 5062.489381 |
| Mean | 957.1309375 |
| Median Absolute Deviation (MAD) | 150 |
| Skewness | 67.75421325 |
| Sum | 19142618.75 |
| Variance | 128912521.2 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 350 | 2808 | 14.0% |
| 500 | 628 | 3.1% |
| 400 | 579 | 2.9% |
| 380 | 546 | 2.7% |
| 600 | 513 | 2.6% |
| 700 | 419 | 2.1% |
| 800 | 388 | 1.9% |
| 450 | 340 | 1.7% |
| 300 | 337 | 1.7% |
| 1000 | 248 | 1.2% |
| Other values (3021) | 13194 |
| Value | Count | Frequency (%) |
| 69 | 1 | < 0.1% |
| 100 | 5 | |
| 105 | 1 | < 0.1% |
| 115 | 1 | < 0.1% |
| 120 | 5 |
| Value | Count | Frequency (%) |
| 959000 | 1 | |
| 875000 | 1 | |
| 668000 | 1 | |
| 486778 | 1 | |
| 174274 | 1 |
| Distinct | 284 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.0969585 |
|---|---|
| Minimum | 0 |
| Maximum | 194344 |
| Zeros | 18930 |
| Zeros (%) | 94.7% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 100 |
| Maximum | 194344 |
| Range | 194344 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1387.42878 |
|---|---|
| Coefficient of variation (CV) | 35.48687247 |
| Kurtosis | 19237.66506 |
| Mean | 39.0969585 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 137.4095781 |
| Sum | 781939.17 |
| Variance | 1924958.62 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18930 | |
| 350 | 136 | 0.7% |
| 600 | 61 | 0.3% |
| 300 | 58 | 0.3% |
| 400 | 57 | 0.3% |
| 200 | 57 | 0.3% |
| 500 | 53 | 0.3% |
| 800 | 31 | 0.2% |
| 250 | 29 | 0.1% |
| 150 | 25 | 0.1% |
| Other values (274) | 563 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 18930 | |
| 1 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 31.48 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 194344 | 1 | |
| 10200 | 1 | |
| 8341 | 1 | |
| 5400 | 1 | |
| 5000 | 1 |
possui_cartao_visa
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 17822 | |
| 1 | 2178 | 10.9% |
possui_cartao_mastercard
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 18101 | |
| 1 | 1899 | 9.5% |
possui_cartao_diners
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 | 32 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 19968 | |
| 1 | 32 | 0.2% |
possui_cartao_amex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 | 41 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 19959 | |
| 1 | 41 | 0.2% |
possui_outros_cartoes
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 | 45 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 19955 | |
| 1 | 45 | 0.2% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 8 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 8 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 12786 | |
| 1 | 7206 | |
| 2 | 8 | < 0.1% |
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2095.614 |
|---|---|
| Minimum | 0 |
| Maximum | 6000000 |
| Zeros | 19072 |
| Zeros (%) | 95.4% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6000000 |
| Range | 6000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 44033.43658 |
|---|---|
| Coefficient of variation (CV) | 21.01218859 |
| Kurtosis | 17218.03756 |
| Mean | 2095.614 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 126.6995194 |
| Sum | 41912280 |
| Variance | 1938943537 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19072 | |
| 25000 | 87 | 0.4% |
| 30000 | 86 | 0.4% |
| 20000 | 83 | 0.4% |
| 50000 | 71 | 0.4% |
| 15000 | 66 | 0.3% |
| 35000 | 63 | 0.3% |
| 40000 | 48 | 0.2% |
| 45000 | 39 | 0.2% |
| 60000 | 37 | 0.2% |
| Other values (84) | 348 | 1.7% |
| Value | Count | Frequency (%) |
| 0 | 19072 | |
| 7 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 18 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6000000 | 1 | |
| 600000 | 1 | |
| 450000 | 1 | |
| 320000 | 1 | |
| 250000 | 2 |
possui_carro
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 13219 | |
| 1 | 6781 |
vinculo_formal_com_empresa
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 11174 | |
| True | 8826 |
possui_telefone_trabalho
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 14519 | |
| True | 5481 | 27.4% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0089 |
|---|---|
| Minimum | 0 |
| Maximum | 32 |
| Zeros | 19973 |
| Zeros (%) | 99.9% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 32 |
| Range | 32 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3888808962 |
|---|---|
| Coefficient of variation (CV) | 43.69448272 |
| Kurtosis | 4536.037419 |
| Mean | 0.0089 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 63.19895877 |
| Sum | 178 |
| Variance | 0.1512283514 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19973 | |
| 1 | 7 | < 0.1% |
| 3 | 4 | < 0.1% |
| 2 | 4 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 19973 | |
| 1 | 7 | < 0.1% |
| 2 | 4 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 32 | 1 | |
| 30 | 1 | |
| 18 | 1 | |
| 15 | 1 | |
| 14 | 1 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3097 |
| Missing (%) | 15.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.045080755 |
|---|---|
| Minimum | 0 |
| Maximum | 17 |
| Zeros | 1398 |
| Zeros (%) | 7.0% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 11 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.210790149 |
|---|---|
| Coefficient of variation (CV) | 0.3990998035 |
| Kurtosis | 1.645508794 |
| Mean | 8.045080755 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.485432434 |
| Sum | 135986 |
| Variance | 10.30917338 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 12103 | |
| 0 | 1398 | 7.0% |
| 11 | 1349 | 6.7% |
| 2 | 1171 | 5.9% |
| 12 | 192 | 1.0% |
| 10 | 173 | 0.9% |
| 16 | 126 | 0.6% |
| 13 | 125 | 0.6% |
| 7 | 89 | 0.4% |
| 8 | 61 | 0.3% |
| Other values (8) | 116 | 0.6% |
| (Missing) | 3097 | 15.5% |
| Value | Count | Frequency (%) |
| 0 | 1398 | |
| 1 | 1 | < 0.1% |
| 2 | 1171 | |
| 3 | 7 | < 0.1% |
| 4 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 17 | 16 | 0.1% |
| 16 | 126 | |
| 15 | 25 | 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 125 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2978 |
| Missing (%) | 14.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.533309834 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1114 |
| Zeros (%) | 5.6% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.532765217 |
|---|---|
| Coefficient of variation (CV) | 0.6050445137 |
| Kurtosis | -1.064418569 |
| Mean | 2.533309834 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3443705658 |
| Sum | 43122 |
| Variance | 2.34936921 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 6882 | |
| 1 | 3144 | |
| 4 | 2924 | |
| 5 | 2822 | |
| 0 | 1114 | 5.6% |
| 3 | 136 | 0.7% |
| (Missing) | 2978 |
| Value | Count | Frequency (%) |
| 0 | 1114 | 5.6% |
| 1 | 3144 | |
| 2 | 6882 | |
| 3 | 136 | 0.7% |
| 4 | 2924 |
| Value | Count | Frequency (%) |
| 5 | 2822 | |
| 4 | 2924 | |
| 3 | 136 | 0.7% |
| 2 | 6882 | |
| 1 | 3144 |
local_onde_reside
Real number (ℝ≥0)
| Distinct | 743 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 581.29525 |
|---|---|
| Minimum | 105 |
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.4 KiB |
Quantile statistics
| Minimum | 105 |
|---|---|
| 5-th percentile | 148 |
| Q1 | 444 |
| median | 596 |
| Q3 | 728 |
| 95-th percentile | 956 |
| Maximum | 999 |
| Range | 894 |
| Interquartile range (IQR) | 284 |
Descriptive statistics
| Standard deviation | 227.369798 |
|---|---|
| Coefficient of variation (CV) | 0.3911433957 |
| Kurtosis | -0.5758248011 |
| Mean | 581.29525 |
| Median Absolute Deviation (MAD) | 144 |
| Skewness | -0.2500355883 |
| Sum | 11625905 |
| Variance | 51697.02503 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 960 | 367 | 1.8% |
| 591 | 345 | 1.7% |
| 570 | 310 | 1.6% |
| 456 | 256 | 1.3% |
| 628 | 249 | 1.2% |
| 685 | 222 | 1.1% |
| 596 | 205 | 1.0% |
| 689 | 196 | 1.0% |
| 619 | 194 | 1.0% |
| 581 | 189 | 0.9% |
| Other values (733) | 17467 |
| Value | Count | Frequency (%) |
| 105 | 1 | < 0.1% |
| 110 | 11 | 0.1% |
| 112 | 1 | < 0.1% |
| 113 | 83 | |
| 114 | 46 |
| Value | Count | Frequency (%) |
| 999 | 2 | < 0.1% |
| 998 | 1 | < 0.1% |
| 997 | 4 | |
| 996 | 2 | < 0.1% |
| 995 | 8 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 1 | 10000 |
| Value | Count | Frequency (%) |
| 1 | 10000 | |
| 0 | 10000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 1 | 10000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20000 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 1 | 10000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20000 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 1 | 10000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 1 | 10000 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| produto_solicitado | dia_vencimento | forma_envio_solicitacao | tipo_endereco | sexo | idade | estado_civil | qtde_dependentes | nacionalidade | possui_telefone_residencial | tipo_residencia | meses_na_residencia | possui_email | renda_mensal_regular | renda_extra | possui_cartao_visa | possui_cartao_mastercard | possui_cartao_diners | possui_cartao_amex | possui_outros_cartoes | qtde_contas_bancarias | qtde_contas_bancarias_especiais | valor_patrimonio_pessoal | possui_carro | vinculo_formal_com_empresa | possui_telefone_trabalho | meses_no_trabalho | profissao | ocupacao | local_onde_reside | inadimplente | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 10 | presencial | 1 | M | 85 | 2 | 0 | 1 | Y | 1.0 | 12.0 | 0 | 480.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 9.0 | 1.0 | 600.0 | 0 |
| 1 | 1 | 25 | internet | 1 | F | 38 | 1 | 0 | 1 | Y | 1.0 | 5.0 | 1 | 380.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | 2.0 | 5.0 | 492.0 | 0 |
| 2 | 1 | 20 | internet | 1 | F | 37 | 2 | 0 | 1 | Y | 5.0 | 1.0 | 1 | 600.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | NaN | NaN | 450.0 | 1 |
| 3 | 1 | 20 | internet | 1 | M | 37 | 1 | 1 | 1 | Y | 1.0 | 1.0 | 1 | 460.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | Y | Y | 0 | 9.0 | 2.0 | 932.0 | 1 |
| 4 | 7 | 1 | internet | 1 | F | 51 | 1 | 3 | 1 | Y | 0.0 | 1.0 | 1 | 687.0 | 600.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 1 | Y | N | 0 | 9.0 | 5.0 | 440.0 | 1 |
| 5 | 1 | 20 | presencial | 1 | M | 21 | 1 | 1 | 1 | Y | 5.0 | 2.0 | 0 | 382.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 1 | Y | Y | 0 | 9.0 | 2.0 | 628.0 | 1 |
| 6 | 1 | 15 | presencial | 1 | F | 64 | 4 | 2 | 1 | Y | 1.0 | 0.0 | 1 | 350.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 10.0 | 1.0 | 190.0 | 1 |
| 7 | 1 | 5 | internet | 1 | F | 20 | 1 | 0 | 1 | Y | 1.0 | 5.0 | 1 | 800.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | NaN | NaN | 299.0 | 1 |
| 8 | 2 | 25 | internet | 1 | F | 39 | 2 | 2 | 1 | Y | 1.0 | 3.0 | 1 | 1200.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | Y | Y | 0 | 9.0 | 2.0 | 756.0 | 0 |
| 9 | 1 | 10 | presencial | 1 | M | 44 | 2 | 2 | 1 | N | 1.0 | 15.0 | 0 | 749.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | Y | N | 0 | 9.0 | 2.0 | 960.0 | 1 |
Last rows
| produto_solicitado | dia_vencimento | forma_envio_solicitacao | tipo_endereco | sexo | idade | estado_civil | qtde_dependentes | nacionalidade | possui_telefone_residencial | tipo_residencia | meses_na_residencia | possui_email | renda_mensal_regular | renda_extra | possui_cartao_visa | possui_cartao_mastercard | possui_cartao_diners | possui_cartao_amex | possui_outros_cartoes | qtde_contas_bancarias | qtde_contas_bancarias_especiais | valor_patrimonio_pessoal | possui_carro | vinculo_formal_com_empresa | possui_telefone_trabalho | meses_no_trabalho | profissao | ocupacao | local_onde_reside | inadimplente | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19990 | 1 | 10 | presencial | 1 | F | 52 | 4 | 0 | 1 | N | 1.0 | 0.0 | 0 | 350.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 0.0 | 1.0 | 872.0 | 1 |
| 19991 | 1 | 10 | presencial | 1 | M | 48 | 2 | 2 | 1 | N | 1.0 | 6.0 | 0 | 1308.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 0.0 | 1.0 | 351.0 | 1 |
| 19992 | 1 | 5 | internet | 1 | M | 62 | 4 | 0 | 1 | Y | 1.0 | 30.0 | 1 | 358.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | 9.0 | 1.0 | 230.0 | 0 |
| 19993 | 1 | 5 | internet | 1 | F | 18 | 1 | 0 | 1 | Y | 1.0 | 6.0 | 1 | 405.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | 9.0 | 2.0 | 289.0 | 0 |
| 19994 | 2 | 5 | presencial | 1 | M | 23 | 2 | 0 | 1 | Y | 1.0 | 23.0 | 0 | 350.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 0.0 | 0.0 | 457.0 | 1 |
| 19995 | 1 | 10 | presencial | 1 | M | 27 | 2 | 0 | 1 | Y | 2.0 | 0.0 | 1 | 423.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | Y | N | 0 | 9.0 | 1.0 | 308.0 | 0 |
| 19996 | 1 | 20 | presencial | 1 | F | 26 | 2 | 1 | 1 | Y | 1.0 | 3.0 | 0 | 350.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | Y | N | 0 | 9.0 | 2.0 | 639.0 | 0 |
| 19997 | 1 | 10 | internet | 1 | F | 63 | 2 | 0 | 1 | Y | 5.0 | 25.0 | 1 | 321.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | 9.0 | 1.0 | 486.0 | 0 |
| 19998 | 1 | 5 | internet | 1 | F | 84 | 1 | 0 | 1 | N | 1.0 | 30.0 | 1 | 380.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | N | N | 0 | NaN | NaN | 590.0 | 0 |
| 19999 | 2 | 20 | presencial | 1 | F | 53 | 1 | 0 | 1 | Y | 1.0 | 11.0 | 1 | 300.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.0 | 1 | N | N | 0 | 9.0 | 5.0 | 132.0 | 0 |
Most frequent
| produto_solicitado | dia_vencimento | forma_envio_solicitacao | tipo_endereco | sexo | idade | estado_civil | qtde_dependentes | nacionalidade | possui_telefone_residencial | tipo_residencia | meses_na_residencia | possui_email | renda_mensal_regular | renda_extra | possui_cartao_visa | possui_cartao_mastercard | possui_cartao_diners | possui_cartao_amex | possui_outros_cartoes | qtde_contas_bancarias | qtde_contas_bancarias_especiais | valor_patrimonio_pessoal | possui_carro | vinculo_formal_com_empresa | possui_telefone_trabalho | meses_no_trabalho | profissao | ocupacao | local_onde_reside | inadimplente | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 10 | internet | 1 | M | 35 | 1 | 0 | 1 | Y | 1.0 | 5.0 | 1 | 620.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 0 | Y | Y | 0 | 9.0 | 5.0 | 380.0 | 1 | 2 |